Maintaining Predictions over Time without a Model

نویسندگان

  • Erik Talvitie
  • Satinder P. Singh
چکیده

A common approach to the control problem in partially observable environments is to perform a direct search in policy space, as defined over some set of features of history. In this paper we consider predictive features, whose values are conditional probabilities of future events, given history. Since predictive features provide direct information about the agent’s future, they have a number of advantages for control. However, unlike more typical features defined directly over past observations, it is not clear how to maintain the values of predictive features over time. A model could be used, since a model can make any prediction about the future, but in many cases learning a model is infeasible. In this paper we demonstrate that in some cases it is possible to learn to maintain the values of a set of predictive features even when a learning a model is infeasible, and that natural predictive features can be useful for policy-search methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of Vapor Deposition Techniques for Membrane Pore Size Modification

The suitability of three vapor deposition techniques for pore size modification was evaluated using polycarbonate track etched membranes as model supports. A feature scale model was employed to predict the pore geometry after modification and the resulting pure water flux. Physical vapor deposition (PVD) and pulsed plasma-enhanced chemical vapor deposition (PECVD), naturally, form asymmetric na...

متن کامل

A synoptic-climatology approach to increase the skill of numerical weather predictions over Iran

Simplifications used in regional climate models decrease the accuracy of the regional climate models. To overcome this deficiency, usually a statistical technique of MOS is used to improve the skill of gridded outputs of the Numerical Weather Prediction (NWP) models. In this paper, an experimental synoptic-climatology based method has been used to calibrate, and decrease amount of errors in GFS...

متن کامل

Comparison of Performance of Traditional Value at Risk Models with Switching Model in Tehran Stock Exchange

The problem of portfolio optimization has made many advances since Markowitz proposed an average-variance-based optimization. It can be said that the most important achievement of the Markowitz model was the introduction of variance as a risk indicator and indeed, the introduction of a quantitative benchmark into it. This research is a model for predicting value at risk. This model extends the ...

متن کامل

شبکه عصبی مصنوعی برای ارزیابی خطر اختلالات حرکتی در نوزادان

 Background: Prediction of developmental disorders in infancy is very important. This study aimed to predict movement disorders of children using Artificial Neural Network (ANN) model. Methods: This was a retrospective study, in which 600 infants with normal and 120 infants with abnormal neurologic examination were evaluated. For analysis, the data divided the study group randomly int...

متن کامل

Film cooling effectiveness in single row of holes: First moment closure modeling

The present article focuses on the evaluation of a first-moment closure model applicable to film cooling flow and heat transfer computations. The present first-moment closure model consists of a higher level of turbulent heat flux modeling in which two additional transport equations for temperature variance kθ and its dissipation rate εθ are ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009